Goto

Collaborating Authors

 practical machine learning


Practical Machine Learning for Aphasic Discourse Analysis

Pittman, Jason M., Phillips, Anton Jr., Medina-Santos, Yesenia, Stark, Brielle C.

arXiv.org Artificial Intelligence

Analyzing spoken discourse is a valid means of quantifying language ability in persons with aphasia. There are many ways to quantify discourse, one common way being to evaluate the informativeness of the discourse. That is, given the total number of words produced, how many of those are context-relevant and accurate. This type of analysis is called Correct Information Unit (CIU) analysis and is one of the most prevalent discourse analyses used by speech-language pathologists (SLPs). Despite this, CIU analysis in the clinic remains limited due to the manual labor needed by SLPs to code and analyze collected speech. Recent advances in machine learning (ML) seek to augment such labor by automating modeling of propositional, macrostructural, pragmatic, and multimodal dimensions of discourse. To that end, this study evaluated five ML models for reliable identification of Correct Information Units (CIUs, Nicholas & Brookshire, 1993), during a picture description task. The five supervised ML models were trained using randomly selected human-coded transcripts and accompanying words and CIUs from persons with aphasia. The baseline model training produced a high accuracy across transcripts for word vs non-word, with all models achieving near perfect performance (0.995) with high AUC range (0.914 min, 0.995 max). In contrast, CIU vs non-CIU showed a greater variability, with the k-nearest neighbor (k-NN) model the highest accuracy (0.824) and second highest AUC (0.787). These findings indicate that while the supervised ML models can distinguish word from not word, identifying CIUs is challenging.


Investigating the Impact of Quantization on Adversarial Robustness

Li, Qun, Meng, Yuan, Tang, Chen, Jiang, Jiacheng, Wang, Zhi

arXiv.org Artificial Intelligence

Quantization is a promising technique for reducing the bit-width of deep models to improve their runtime performance and storage efficiency, and thus becomes a fundamental step for deployment. In real-world scenarios, quantized models are often faced with adversarial attacks which cause the model to make incorrect inferences by introducing slight perturbations. However, recent studies have paid less attention to the impact of quantization on the model robustness. More surprisingly, existing studies on this topic even present inconsistent conclusions, which prompted our in-depth investigation. In this paper, we conduct a first-time analysis of the impact of the quantization pipeline components that can incorporate robust optimization under the settings of Post-Training Quantization and Quantization-Aware Training. Through our detailed analysis, we discovered that this inconsistency arises from the use of different pipelines in different studies, specifically regarding whether robust optimization is performed and at which quantization stage it occurs. Our research findings contribute insights into deploying more secure and robust quantized networks, assisting practitioners in reference for scenarios with high-security requirements and limited resources.


Bridging Diversity and Uncertainty in Active learning with Self-Supervised Pre-Training

Doucet, Paul, Estermann, Benjamin, Aczel, Till, Wattenhofer, Roger

arXiv.org Artificial Intelligence

This study addresses the integration of diversity-based and uncertainty-based sampling strategies in active learning, particularly within the context of self-supervised pre-trained models. We introduce a straightforward heuristic called TCM that mitigates the cold start problem while maintaining strong performance across various data levels. By initially applying TypiClust for diversity sampling and subsequently transitioning to uncertainty sampling with Margin, our approach effectively combines the strengths of both strategies. Our experiments demonstrate that TCM consistently outperforms existing methods across various datasets in both low and high data regimes.


SUPClust: Active Learning at the Boundaries

Ono, Yuta, Aczel, Till, Estermann, Benjamin, Wattenhofer, Roger

arXiv.org Artificial Intelligence

Active learning is a machine learning paradigm designed to optimize model performance in a setting where labeled data is expensive to acquire. In this work, we propose a novel active learning method called SUPClust that seeks to identify points at the decision boundary between classes. By targeting these points, SUP-Clust aims to gather information that is most informative for refining the model's prediction of complex decision regions. We demonstrate experimentally that labeling these points leads to strong model performance. This improvement is observed even in scenarios characterized by strong class imbalance.


Practical Machine Learning

#artificialintelligence

One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates. The course will also introduce a range of model based and algorithmic machine learning methods including regression, classification trees, Naive Bayes, and random forests. The course will cover the complete process of building prediction functions including data collection, feature creation, algorithms, and evaluation.


Practical Machine Learning in R: Nwanganga, Fred, Chapple, Mike + Free Shipping

#artificialintelligence

Mike Chapple is Teaching Professor of IT, Analytics, and Operations at the University of Notre Dame's Mendoza College of Business where he teaches graduate and undergraduate courses in cybersecurity and business analytics. Prior to joining Notre Dame's faculty, Mike served as Senior Director for IT Service Delivery at the University. In this role, he oversaw the information security, IT compliance, cloud computing, data governance, IT architecture, learning platforms, project management, strategic planning and product management functions for the Office of Information Technologies. Mike led Notre Dame's Cloud First strategy which moved 80% of the institution's IT services into the cloud over three years. Mike previously served as Senior Advisor to the Executive Vice President at Notre Dame for two years.


O'Reilly - Practical Machine Learning for Computer Vision - ch3

#artificialintelligence

Using machine learning models to extract information from images is one of the trickiest ML tasks--but it often yields invaluable insights. What's more, image classification is the "Hello World" of deep learning: It's a stepping stone to other deep learning domains, such as natural language processing. In chapter 3, the authors of Practical Machine Learning for Computer Vision lay out the techniques and model architectures that take advantage of the special properties of images. And did we mention, it's free?


Practical Machine Learning in JavaScript: TensorFlow.js for Web Developers: Gerard, Charlie: 9781484264171: Amazon.com: Books

#artificialintelligence

You'll learn not only theory, but also dive into code samples and example projects with TensorFlow.js. Using these skills and your knowledge as a web developer, you'll add a whole new field of development to your tool set. This will give you a more concrete understanding of the possibilities offered by machine learning. Discover how ML will impact the future of not just programming in general, but web development specifically.

  Industry: Retail > Online (0.40)